issue/634: InfiniCore 支持InfiniLM Llama模型适配 #635

Ceng23333 · 2025-11-20T06:53:08Z

#634
测例编译
xmake build infinicore-test

测例执行
INFINICORE_LOG_LEVEL=info build/linux/x86_64/release/infinicore-test --nvidia --test all

Signed-off-by: Ceng23333 <[email protected]>

PanZezhong1725 · 2025-11-25T07:50:14Z

src/infinicore/context/context_impl.hpp

    Runtime *getCpuRuntime();

+    // Get runtime for a specific device (creates it if it doesn't exist)
+    Runtime *getRuntime(Device device);


这个接口有哪里用过吗？

这个之前调试加的接口，后面没用了

wooway777 · 2025-11-25T06:58:58Z

include/infinicore/nn/rope.hpp

-     * @brief RoPE algorithm type
+     * @brief Frequency generation method for RoPE cache
+     */
+    enum class FreqGen {


为啥要有这个呢？都用Algo行不行？

wooway777 · 2025-11-25T07:51:59Z

include/infinicore/nn/rope.hpp

    RoPE(size_t head_dim,
         size_t max_seq_len,
         double theta = 10000.0,
+         Algo freq_gen = Algo::GPT_J,


所以这个地方又是为什么要两个呢？

rope如果有问题的话需要解决一下

问了gpt，hf确实是这样算的

Signed-off-by: Ceng23333 <[email protected]>

PanZezhong1725 · 2025-11-25T09:34:34Z

xmake.lua


 target("infinicore_c_api")
-    set_kind("phony")
+    set_kind("shared")


单独拆出来一个cpp api

PanZezhong1725 · 2025-11-25T09:40:30Z

src/infinicore/context/runtime/runtime.hpp

    std::unique_ptr<MemoryAllocator> device_memory_allocator_;
    std::unique_ptr<MemoryAllocator> pinned_host_memory_allocator_;
+    // Mutex to protect stream access for thread safety
+    mutable std::mutex stream_mutex_;


不需要这个锁

PanZezhong1725 · 2025-11-25T09:43:23Z

src/infinicore/context/runtime/runtime.cc

-    if (pinned_host_memory_allocator_) {
-        pinned_host_memory_allocator_.reset();
+    // Wrap entire destructor in try-catch to prevent exceptions from causing segfaults
+    try {


写的太复杂

PanZezhong1725 · 2025-11-25T09:48:24Z

src/infinicore/context/runtime/runtime.cc

 }

 void Runtime::memcpyD2H(void *dst, const void *src, size_t size) {
+    SPDLOG_DEBUG("[RUNTIME] memcpyD2H: Called with runtime device: {}", device_.toString());


没必要做这么多检查

PanZezhong1725 · 2025-11-25T09:49:20Z

src/infinicore/tensor/copy.cc

    }
-    if (this->device().getType() == src->device().getType()) {
-        op::rearrange_(Tensor(const_cast<TensorImpl *>(this)->shared_from_this()), src);
+    if (this->device().getType() == src->device().getType() && this->device().getIndex() == src->device().getIndex()) {


检查太多了

Ceng23333 force-pushed the issue/634 branch from bbd9dbc to bba30f0 Compare November 24, 2025 08:05

issue/634: InfiniCore 支持InfiniLM Llama模型适配

283c618

Signed-off-by: Ceng23333 <[email protected]>

Ceng23333 force-pushed the issue/634 branch from 9d6ac0e to 283c618 Compare November 25, 2025 01:32

Ceng23333 requested a review from PanZezhong1725 November 25, 2025 01:34

Ceng23333 marked this pull request as ready for review November 25, 2025 01:34

Ceng23333 requested review from pengcheng888 and wooway777 November 25, 2025 01:35

PanZezhong1725 added 紧急！类型：开发准备好了 labels Nov 25, 2025

Ceng23333 added 3 commits November 25, 2025 13:10

update logic of test case

e066550

Signed-off-by: Ceng23333 <[email protected]>

fix compilation

2f82dad

Signed-off-by: Ceng23333 <[email protected]>

fix compilation

52d3b1b

Signed-off-by: Ceng23333 <[email protected]>

PanZezhong1725 reviewed Nov 25, 2025

View reviewed changes

wooway777 requested changes Nov 25, 2025

View reviewed changes

fix compilation

51b79fb

Signed-off-by: Ceng23333 <[email protected]>

PanZezhong1725 requested changes Nov 25, 2025

View reviewed changes

Ceng23333 marked this pull request as draft November 25, 2025 09:58

Ceng23333 mentioned this pull request Nov 25, 2025

issue/634: InfiniCore 支持InfiniLM Llama模型适配 #668

Open

PanZezhong1725 removed 紧急！类型：开发准备好了 labels Nov 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

issue/634: InfiniCore 支持InfiniLM Llama模型适配 #635

issue/634: InfiniCore 支持InfiniLM Llama模型适配 #635

Uh oh!

Ceng23333 commented Nov 20, 2025 •

edited

Loading

Uh oh!

PanZezhong1725 Nov 25, 2025

Uh oh!

Ceng23333 Nov 25, 2025

Uh oh!

wooway777 Nov 25, 2025

Uh oh!

wooway777 Nov 25, 2025

Uh oh!

wooway777 Nov 25, 2025

Uh oh!

Ceng23333 Nov 25, 2025

Uh oh!

PanZezhong1725 Nov 25, 2025

Uh oh!

PanZezhong1725 Nov 25, 2025

Uh oh!

PanZezhong1725 Nov 25, 2025

Uh oh!

PanZezhong1725 Nov 25, 2025

Uh oh!

PanZezhong1725 Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

issue/634: InfiniCore 支持InfiniLM Llama模型适配 #635

Are you sure you want to change the base?

issue/634: InfiniCore 支持InfiniLM Llama模型适配 #635

Uh oh!

Conversation

Ceng23333 commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Ceng23333 commented Nov 20, 2025 •

edited

Loading